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ABSTRACT 

We demonstrate that the space formed by the star-formation rate (SFR), gas-phase metallicity (Z), 
and stellar mass (M*), can be reduced to a plane, as first proposed by Lara-Lopez et al. We study three 
different approaches to find the best representation of this 3D space, using a principal component analysis, 
a regression fit, and binning of the data. The PCA shows that this 3D space can be adequately represented 
in only 2 dimensions, i.e., a plane. We find that the plane that minimises the^ 2 for all variables, and hence 
provides the best representation of the data, corresponds to a regression fit to the stellar mass as a function 
of SFR and Z, M*=/(Z,SFR). We find that the distribution resulting from the median values in bins for 
our data gives the highest^- 2 . We also show that the empirical calibrations to the oxygen abundance used 
to derive the Fundamental Metallicity Relation (Nagao et al.) have important limitations, which contribute 
to the apparent inconsistencies. The main problem is that these empirical calibrations do not consider the 
ionization degree of the gas. Furthermore, the use of the N2 index to estimate oxygen abundances cannot 
be applied for 12+log((D/H) > 8.8 because of the saturation of the [N 11] /16584 line in the high-metallicity 
regime. Finally we provide an update of the Fundamental Plane derived by Lara-Lopez et al. 
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Stellar mass (M*), metallicity (Z), and star-formation 
rate (SFR) are key galaxy properties. M* reflects the 
amount of gas locked up in stars over a galaxy's his- 
tory. SFR indicates the current rate at which gas is 
being converted into stars. Z reflects the gas repro- 
cessed by stars over the course of stellar evolution, 
and any exchange of gas between the galaxy and the 
environment. The relationships between these three 
properties are fundamental in understanding galaxy 
evolution. Indeed, models of galaxy formation within 
the A-CDM scenario already include chemical hy- 



drodynamic simulations (e.g.. iDe Lucia et ail 120041; 

Tissera et al 2005; De Rossi et al. 2006; Da ve & Oppenheimen 



20071; iMartmez-Serrano etail 2008) 

In the last few years it has been found that M*, 
Z, and SFR are strongly interrelated. Analyzing 
galaxy measure ments from the Slo an Digital Sky 
Survey (SDSS), lEllison et all d2008h found that the 
mass-metallicity (M*-Z) relation for star-forming 
(SF) galaxies depends on the SFR. Subsequently, 
Lara-Lopez et alj d2010al) reported the existence of a 
Fundamental Plane (FP) between these three param- 
eters. These authors confirmed that the M+-Z and 
M*-SFR relations are just particular cases of a more 
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general relationship. lLara-Lopez et al.l (1201 Oal) fitted 
a plane and derived an expression for the stellar mass 
as a function of the gas metallicity and SFR (the Fun- 
damental Plane, FP). In a parallel and independent 
study, using the same SPSS data , but d ifferent Z and 
SFR estimations, Mannucci et al.l d2010b found a sim- 
ilar fundamental relationship, but instead expressed 
Z as a combination of M* and SFR with a substan- 
tially different quantitative relationship. They refer to 
this correlation as the Funda mental Meta l licity Rela- 
tion (FMR). In a recent study. I Yates et al.l (120120 used 
models and SDSS data to analyze the dependences of 
different combinations be tween SFR, Z and M*. They 
found that the fit given by Mannucci et al. (2010) does 
not significantly reduce the dispersion of the metallic- 
ity compared to the M*-Z relation. 

A fundamental requirement in all these analyses is 
obtaining a reliable estimation of the galaxy metallic- 
ity. The most robust method to derive the metallicity 
in SF galaxies is via the estimate of metal abundances 
and abundance ratios, in particular through the deter- 
mination of the gas-phase oxygen abundance. This is 
typically achieved through the analysis of emission- 
line spectra of Hn regions within the galaxies. A 
proper determination of the oxygen abundance re- 
lies on the detectio n of the \Q m] ,14363 auroral lin e 
(the T e method, e.g. jLopez-Sanchez & Esteban 2009), 
but this emission line is usually not observed because 
of its faintness. Consequently, it is common to in- 
voke the so-called strong-line methods. These tech- 
niques assume that the oxygen abundance of an Hn 
region can be derived using only a few bright emis- 
sion lines. Empirical calibrations based on photoion- 
ization models, however, systematically over-predict 
by 0.2-0.6 dex the oxygen abundances derived using 
the T e met hod and those ca l ibrations which are base d 
on it (see I Yin et all l2007t iKewlev & Ellison! [200: 



Bresolin et al. 2009; Lopez-Sanchez & Estebanll201 
Moustakas et al.l 2010t lLopez-Sanchez et al.l l2012h . 
We discuss these issues further in §[3]below. 

Here we explore three different approaches to the 
representation of the three-dimensional distribution of 
M*, SFR, and Z for galaxies. We detail our sample se- 
lection in §[2] and review some issues with metallicity 
estimators in §[3] The analysis is presented in §[4] and 
we explore the implications for relationships between 
SFR and Z in §|5] We present a discussion of the out- 
come of our analysis, and summarise our results, in 
§0 



2. Sample selection 

We use data from the Slo an Digital Sky Surve 
Data Release 7 (SDSS-DR7. [Abazaiian et alJ EolF 



Adelman-McCarthv et al J 120071) . using the emission 



'ey 

)9; 



line analysis performed by the MPA-JHU groupQ. 
From the SDSS-DR7 databas e, we selected gala xies in 
the "main galaxy sample" dStrauss et alJ l2Q02h . with 
apparent Petrosian r magnitude of 14.5 < m r < 11.11 
and in the redshift range 0.04 < z < 0.33. The lower 
limit ensures that at least 20% of the galaxy light will 
be inside the 3" of the SDSS fiber, whic h is the mini- 
mum required to avoid aperture effects dKewley et al. 



2005). To ensure reliable metallicities, we imposed a 
minimum signal-to-noise ratio, SNR > 8, for each 
of the most prominent lines Ha, H/J, [Nn]/16584, 
[O in] ,15007, and [On] ,13728. The specific SNR 
threshold chosen turns out not to be critical to the re- 
sults. The impact of SNR choice, and the selection of 
lines it is imposed upon, is explored in detail by Fos- 
ter et al. (in prep.), who show that for SNRs between 
about 3 and 8, the resulting mass-metallicity relation 
does not change substantially. For higher SNR thresh- 
olds, though, the weaker (often lower SNR) lines may 
be excluded, leading to a reduced sensitivity to the 
high-metallicity population. 

We selected SF gala xies f ollowing the criteria of 



Kauff mann et al.l d2003l) on the Baldwin, Phi llips & Terlevich 



(119811) diagram, log([0 m] ,15007 / H/3) vs. log([N n] ,16584 / Hor). 
The above criteria give us a sample of 45 475 galaxies. 

We use gas-phase metallicities, total SFRs, and stel- 
lar masses derived by the MPA-JHU group. Metal- 
licities were estimated through a Bayesian approach 
based on simultaneous fits to all the most p romi- 
nent lines according to iTremonti et al .1 (120041 here- 
after, T04), while total SFRs were obtained from 



Brinchmann et al. (2004). Aperture effects are again 



an important issue to con si der in the es t imate of SFR 



dBrinchmann et al.l EooH) . lYates et all d2012h shows 



that if SFRs are calculated without making an aper- 
ture correction, nearby galaxies will have their SFRs 
underestimated, and this leads to a reduced spread in 
the SFR distribution for low mass, low redshift galax- 
ies. This will have an impact on any exploration of 
the SFR dependence of t he M+-Z relatio n. Both 



Trem onti et alJ d2004l) and Brinchma nn et al.l (12004) 



make use of the po pulation synthesis and photo ioniza- 
tion codes given bv lCharlot & Longhettild200ll) . Stel- 



'http://www.mpa-garching.mpg.de/SDSS 
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lar masses wer e estimated from fits to the photometry 



as described in Kauffmann et al. (2003). 



3. Metallicity estimate issues 

Since the calculation of metallicity is a particu- 
larly challenging process, the most accurate and re- 
liable measurements available should be used when- 
ever possible. Approximating metallicity estimates 
through simple parameterisations can be a valuable 
tool when only limited information is at hand, but such 
approximations have significant limitations and uncer- 
tainties. For example, the N2 = log([Nn] /i6584/ Ha) 
<mdR 2i = ([On]/l3727+ [O m] ^4959,5007)/H/? pa- 
rameters can be used to estimate metallicity with the 



be appreciated in Fig.QJ, which shows the relationship 
between the T04-derived Z and the N2 parameter. It is 
clear that the N2 index saturates for 12+log(0/H) > 

8.8. 

Thi s agrees with the result found by lYin et al 
(12007b . who concluded that empirical calibrations in- 
volving the N2 parameter are only valid for galaxies 
with 12+log(0/H)<8.5 -when comparing with T e - 
based abundances; this limit should be 8.7-8.9 when 
Z has been der ived using photoionization m odels, as 
in our case, see 



Lopez-Sanchez et alj d2012l) — . 



The use of the N2 method in a metallicity regime 
for which it is not valid should therefore be avoided. 
Even when this method is bein g averaged with another 



tven wh en this method is being 
(e.g. i ^, lMannucci et alfeoid 



Nag ao. Maiolino & Marconi 



empirical calibrations of Nagao, Maio lino & Marconi 
(2006, hereafter, N06). The N06 calibrations are cu- 
bic fits between the R23 or the N2 parameter and the 

T04-derived metallicity (see Fig.[T|). 

I ; ipendence on the M+-Z relation using the N06 and 

It i s well known however (e.g. | Baldw in. PhilliDs^lerlevich T04 metallicities It can be appreciated from that 

1 98 l;McGaugh 1991; Kewlev &Dopita 2002; Pilvugin&Thuan tU t t , , , ^ D j- ■ ,u 

ngure that the dependence on SFR disappears in the 



2006), this can still drastically affect the resulting de- 

Affr, and Z. For example, 
(120121) shows the SFR de- 



pendencies between SFR 
Figure 1 of lYates et al 



2005 



Lopez-Sanchez & Esteba n 2010; Lopez-Sanchez et alj 



2011) that the degree of ionization of the gas plays a 
fundamental role in deriving a reliable estimation of 
the oxygen abundance. The most common empiri- 
cal calibration s based on either a direct determina- 
tion of the T e (Pilyugin & Thuan 2005; Pilvugin et al 



2010) or photoionization models ( Kewlev & Dopita 



2002; Kobulnicky & Kewlevl 12004 iTremonti et al. 



2004) do consider an ionization parameter. Without 



such information, the uncertainty of the derived oxy- 
gen abundances may be as large as 0.25 - 0.40 dex 
dLopez-Sanchez & Estebanll2010l) . 

Indeed, even the T e method itself is subject to un- 
certainties, as temperature fluctuations and gradients 
can render the T e method incorrect by up to 0.4 dex 

, Peimbert et alj 



high metallicity regime when the N06 metallicities are 
used. This lack of dependence can be explained by 
the saturation of the N2 method in the high metallicity 
regime. On the other hand, when the T04 metallici- 
ties are used, this dependence is observed over the full 
metallicity range. To further support this observation, 
Lara-Lopez et al. (in preparation) analyzed the same 
dependen ce using several combinations of metallic- 
ity (e.g.. |Pettini&Pagelll2004t ITremonti et all [2 004; 
Kewlev & Dopitall2002h and SF R (e.g. iHopkins eTal 



2003; Brinch mann et al. 2004) indicators, obtaining 



Esteban et al 



2007; 



2009; 



(e.g., Lopez-Sanchez et al]|2007t 
Garcfa-Roias & Esteban! [20071; 
Mesa-Delgado & Esteban! |20 101) . and absolute metal- 
licity estimates ar e challenging to make at b est (see 



recent review by lLopez-Sanchez et alJ 120121) . Rel- 
ative estimates, thou gh, can be made to r easonably 



high accuracy (e.g., ITremonti et alj [2004), as long 



as a consistent estimator is being used. As a conse- 
quence, the simplified fits of N06 should be avoided 
when sufficient emission lines are available to make 
a more reliable and direct estima te of the metallicity 
dLopez-Sanchez & Estebanll2010l) . 

Furthermore, the use of the N2 parameter to derive 
Z is not valid in the high metallicity regime. This can 



for all the possible combinations a strong dependence 
of SFR in the M+-Z relation for the full metallicity 
range. 

4. A 3D analysis of the M*, Z, and SFR space 

Here we aim to identify the most compact represen- 
tation of the data distribution in the 3D space of M*, Z 
and SFR. We examine three methodologies: (/) fitting 
a plane to the 3D distributio n using PCA, (ii) fittin g 
a plane through regression dLara-Lopez et al. 2010a), 
and (Hi) binning in SFR and M» to obtain th e median 
metallicity in each bin (Mannucci et alJEoiOh . 

As we want to perform a self-consistent compari- 
son of the results, and because of th e metallicity is- 



sues described in §[3j we do not use the Man nucci et al. 



d2010l) method to derive Z, in the third approach men- 
tioned above. Rather we test all the approaches self- 
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Fig. 1. — Panels a and b compare the log(/?23) and N2 parameters with the T04 metallicitie s, respectively. T he blue 
line gives the N06 calibrations, while the red dashed line indicates the limit imposed by lMannucci et alJ ( 2010h to then- 
cubic fits. Panels c and d show the metallicity obtained by the N06 calibrations using the log(/?23) and N2 indexes 
vs. the T04 metallicities, respectively. Panel e shows the mean of the N06 metallicities when the difference between 
log(7?23) and N2 is less than 0.25 dex. The bottom right panel shows the residuals of panels c, d and e. We emphasize 
that the residuals show a scatter comparable with the intrinsic width of the M - Z relation. 
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consistently using the MPA-JHU measurements de- 
tailed §|2] 

Throughout this section we use the reduced chi- 
squared as a measure of goodness of fit, defined by 
the following equation: 



2 

X red 



(Oi-Ed 1 



v ^— ' cr^- 

where O and E are the observed data and model esti- 
mate, respectively. The value v is the number of de- 
grees of freedom given by v = N — n— 1, where TV is 
the number of observations, and n is the number of fit- 
ted parameters, a 2 is the variance of the observations 
defined by: 



N 

where /u is the mean of the observed data. 



(2) 



4.1. PCA analysis 

We performed a principal component analysis 
(PCA) to identify the underlying dimensionality of 
the three observables. PCA is a mathematical pro- 
cedure that converts a set of observations of possibly 
correlated variables into a set of uncorrected variables 
called principal components. One of the goals of PCA 
is to reveal hidden structure in a dataset, as well as to 
reduce the dimensionality of the data. A high correla- 
tion between variables is an indicator of high redun- 
dancy in the data, while the most important and inde- 
pendent variables are those that account for the largest 
variance. Since high correlation is a mark of high re- 
dundancy, the principal components s hould have lo w 
or even zero correlation between them ( Shlensl l2009). 

We find that the first two principal components ac- 
count for 86% and 12% of the variance, which in- 
dicates that 98% of our data can be explained in a 
2 dimensional space. As a result of the PCA pro- 
cedure we obtain 3 eigenvectors, referred to here as 
compl, comp2, comp3, which are expressed as the 
vector of coefficients of the three parameters, x=M+, 
y=12+log(0/H), and z=log(SFR). The first principal 
component indicates the direction of the highest vari- 
ance, and is given by comply (0.7140,0.1679,0.6796), 
while the second and third components account for 
the highest possible variance in orthogonal direc- 
tions: comp2= (0.5952,0.3654,-0.7156), comp3= 
(0.3686,-0.9155,0.1609), see Fig. [2] It is important 
to note here that one of the weak points of PCA is that 



it relies on the covariance matrix, which is less robust 
against outliers than other methods. We return to this 
point in the discussion below. 

The plane obtained through PCA is given by: 

a [MJM e ] +J3 [12 + log(0/H)] + y [log(SFR)] = 6 (3) 

where a = 0.3686, /? = -0.9155, y = -0.1609, 5 = 
-4.5578 

Solving Eq. [3] in turn for each of M*, Z and SFR, 
as a function of the other two, we obtain the relations 
shown in Fig. [3] Although PCA gives an acceptable re- 
sult in reproducing the M* W\t\\x 2 red = 0.34, the metal- 
licity and especially the SFR present a \\\g\\x 2 red of 1.0 
and 0.94, respectively, (Fig. [3). It can also be appreci- 
ated in Fig.[3]that the fit provided by PCA is less effec- 
tive at capturing the shape of the distribution, in partic- 
ular at the low mass and metallicity end. This is a con- 
sequence of the covariance matrix sensitivity to outly- 
ing data points. This sensitivity effectively pushes the 
principal components to try to represent extremes in 
the data that are not representative of the bulk of the 
measurements. 

4.2. Regression 

We next explore regression to represent our 3D data 
distribution. Regression aims to explain one variable 
in terms of the others, and uses robust methods that 
are less affected by outlying observations. Using re- 
gression we first fit a plane to M* as a function of SFR 
and Z, obtaining: 



log(M,/M Q ) = a m [12 + log(0/H)] + m [log(SFR)] + y m 

(4) 

where a m = 1.3824, f3 ln = 0.5992, y m = -2.5729. 

The plane derived in this way was ca l led the Funda- 
mental Plane (FP) by lLara-Lopez et aL ( 2010ah and is 
represented in Fig. [2 in b lue. The difference betwe en 
Eq.|4]and that presented in Lara-Lopez et al. ( 2010al) is 
that here we are including galaxies at higher redshifts, 
up to z ~ 0.35. This improves the sampling of the high 
mass galaxy population, thus slightly changing the ori- 
entation of the plane. A detailed discussion of redshift 
and mass completeness will be given in Lara-Lopez et 
al. (in preparation). 

Although the FP (Eq. [U is defined to minimize the 
variance in the estimate of M*, we also tested it to 
estimate the metallicity and SFR of galaxies by rear- 
ranging Eq. Hto solve for the other variables (Fig. [4]). 
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Fig. 2. — Different orientations of the 3D space formed by M*, SFR, and Z. The metallicity is always kept on the 
vertical axis, the cube is just moved rightwards, then tilted forward slightly, in going from the top to the bottom panels. 
The left panels show our newly derived FP (shaded ), while the co lored square points show the median metallicity 
taken in bins of SFR and M+ (as for the FMR, Mann ucci eValJuO lO). Square points are color-coded from low (red) to 
high (blue) SFR. Black data points are the full sample (§|2]i. The vectors show the first PCA component in yellow, the 
second in green, and the third in blue. The right panels show the same orientation and information as the left panels 
but omitting the underlying sample of SDSS galaxies. Upper panels show a face-on view of the FP, middle panels 






Fig. 6. — Plane fitted to SFR using regression, SFR=/(Z, M+). 
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0.34 


0.25 


1.0 


0.1 


0.94 


0.58 


M*=/(Z, SFR) (FP) 


0.29 


0.18 


0.60 


0.13 


0.52 


0.32 


Z = f(M*, SFR) 


0.44 


0.29 


0.97 


0.1 


1.75 


0.78 


SFR= /(M*,Z) 


0.29 


0.24 


1.17 


0.28 


0.68 


0.24 


FMR 






0.88 


0.1 







Table 1 : Summary ofx 2 ed and cr of the residuals for M*, metallicity and SFR using planes derived by PCA, regression, 
and the FMR. 



Estimating xj, as before, we see that the metallicity 
obtained through the FP gives - 0.60. Fig. HJ} 
also demonstrates that the estimate of Z follows a lin- 
ear behaviour over the full metallicity range. We also 
tested this FP to estimate the SFR of galaxies, shown 
in Fig.HJ;, obtaining x 2 red = 0.52, which is an improve- 
ment compared to the^ , = 0.94 obtained for the SFR 
through PCA. Again, Fig. 2J; shows a more linear re- 
lationship through the full range of SFR compared to 
PCA. 

To test whether the choice of M* as the dependent 
variable in Eq.|4]is the optimal approach, we also ana- 
lyzed the two other possible planes, fitting in turn to Z 
and SFR (Fig. Hand Fig.©. 

Fitting a plane to Z as a function of SFR and M+, 
we obtain: 



12 + log(0/H) = a z [MJMe] +/? z [log(SFR)] + y z (5) 

where a z = 0.3504, & = -0.1289, y z = 5.4882, 
giving x 2 d = 0.97 for the metallicity. To compare this 
plane with the others, we again also estimate and 
SFR by rearranging Eq.|5] We find^ = 0.44 for M*, 
and^, = 1.75 f or SFR (Fig.0. 

We also fit SFR as a function of Z and M* to obtain: 



log(SFR) = a s [M*/M ] + /3 S [12 + log(0/H)] + y s (6) 

where a s = 0.9924, yS. s = -0.8511, y s = -1.9167, 
and giving x 2 red - 0.68 for the SFR. Again, we estimate 
M* and Z by rearranging Eq. [6] This gives x 2 ed = 0.29 
for M*, and^ = 1 . 17 for Z. 

4.3. Binning data 



Following Mannucc i et al. d2010l) . we generated a 



grid. The resulting values are shown in Fig. [2] This 
figure only shows median values for those bins con- 
taining at least 50 galaxies. It can be seen that, despite 
the different metallicity estimator, w e can reproduce 



the sh ape of the surface obtained by Man nucci et al 



(12010b . By showing the underlying data used in deriv- 
ing this surface (left panels of Fig. [2j, it becomes clear 
that the curvature in this surface is not representative 
of the actual data distribution. 

To compare how accura tely the FMR can repro- 
duce metallicity we follow lYates et al.1 (12.0121) . since 
they use the same SDSS measurements of metallic- 
ity, SFR, and M+ as in the current work. We used 



grid of 0.1 1 dex in log(SFR), and 0.15 dex in M* and 
estimated the median metallicity in every square of the 



Eq. (1) of I Yates et ail (12012b to estimate // a =log(M+) 
- a log(SFR), with a - 0.19. We then estimated 
metallicity using Eq. (2) of the same paper, given by 
12+log(0/H) = 43.4476 - 12.1927* + 1.37284.x 2 - 
0.0498564x 3 , with x = ii a . 

The comparison between the metallicity obtained 
through the FMR and the original value gives x 2 d - 
0.88. (Fig. ig. 

4.4. Results 

The summary in TableQ]indicates that the plane that 
best minimizes x 2 red for M+, Z, and SFR, is the plane 
corresponding to the regression on M*, the FP. Also, 
this plane is the only one that maintains linearity over 
the full range of values between the observed and the 
estimated values in each projection. This further con- 
firms that a plane is a good representation of the whole 
3D distribution. 

Empirically, we can imagine that the M*-Z, 
M*-SFR, and Z-SFR relationships are the faces of 
this 3D distribution. The relationship that shows the 
highest dispersion (§[5) is the Z-SFR (ILara-Lopez et al. 
201033), which means that this relation is close to the 
face-on orientation of the 3D distribution (top panels 



8 



o 

K 9.2 



CO 



9.0 



8.8 



CO 



8.6 



X\ ed = 0.88 



E 0.4 
Z 0.2 



8.4 L 
8.4 



-1.0-0.5 0.0 0.5 1.0 
residuals 



8.6 



9.0 



9.2 



12+log(0/H) 



Fig. 7. — Metallicity of SDSS galaxies compared to 
the metallicity estimated through the FMR. The solid 
line shows the one to one relation. 

of Fig. ID. 

While M+ correlates with both SFR and metallicity 
(the well known M*-Z and M*-SFR relationships), 
the SFR does not strongly correlate with metallicity. 
Therefore, SFR and Z are the least correlated variables, 
and a linear combination of these two parameters is 
enough to explain all three variables. 

5. The Z-SFR relation 

We turn now to a discussion of the Z-SFR relation, 
and the impact of taking the medians of the data in 
bins defined in different order. We have just described 
the Z-SFR relation as being close to the face on view 
of the FR and the correlation between Z and SFR is 
not intrinsically tight. In consequence, any fit will suf- 
fer from a high degree of intrinsic scatter in the data 
around the fit. 

This statement can be quantified using the Pearson 
correlation coefficient, which is a test of correlation 
between two variables, and is defined as c — {<r 2 a x cr 2 b ) 
I (cr a x (Tb). The quantity c is a scalar in the interval 
[—1.0, 1.0], where —1.0 and 1.0 indicate a negative or 
positive perfect fit, respectively, while values close to 
0.0 would indicate a poor correlation. Applying this 
test to our relationships, we obtain c = 0.72 for the 
M*-Z relation, c = 0.76 for the M*-SFR relation, 
and c = 0.48 for the Z-SFR relation. Therefore, of 
our three relationships, the Z-SFR is the relation that 
shows the highest dispersion. 



Tak ing Z as the key quantity, iMannucci et al. 
(2010) studied the M+ dependence of the Z-SFR re- 
lation by first taking bins of M+ and then estimating 
the median Z in SFR bins. This procedure can be 
thought as the projection of the FMR on the M*-SFR 
face of the 3D distribution. The result is shown in 
Fig- Et- Although this relation has a high scatter, there 
is a tendency for the SFR generally to increase with 
Z. Fig. [8^ shows, however, that binning as a function 
of mass reveals a more subtle effect, with the resulting 
median values showing opposing trends depending on 
the mass selected. While the median metallicity in- 
creases with increasing SFR for log(M*) > 10.5, it 
decreases with increasing SFR for log(M*) < 10.5 
(Fig. EJi). 

The mass dependence of the Z-SFR relation can 
alternatively be explored by binning in a different or- 
der. For every M+ bin, we can estimate the median 
SFR in metallicity bins. Fig. [HJ? clearly shows that this 
binning order gives an apparently different result. For 
every M+, the SFR is almost flat or slightly increas- 
ing with the metallicity. There is a crucial distinction 
to be made here, which is one of correct interpreta- 
tion. In the former case, the median metallicity has 
been estimated for a given M* and SFR, while in the 
latter, the median SFR has been estimated for a given 
Mi, and Z. It is true to say from the former approach 
that, at a given mass, as SFR increases, the median 
metallicity either increases (high-mass) or decreases 
(low-mass). It is also true to say from the latter ap- 
proach that, at a given mass, as metallicity increases, 
the median SFR either increases (high-mass) or stays 
relatively constant (low-mass). These statements are 
not inconsistent with each other. The confusion arises 
when trying to impose an interpretation that is incon- 
sistent with the motivation for binning in a particular 
order. It would not be true to conclude from Fig.[8k, for 
example, that the median SFR decreases with increas- 
ing metallicity (low-mass) or increases with increasing 
metallicity (high-mass), since the median SFR has not 
been calculated here. 

We emphasize here that when fitting a relation to 
a distribution of data, the full dataset should be used. 
Fitting only to a representation of the full dataset that 
is derived from the median values of one parameter in 
bins of the others will clearly result in different sur- 
faces being derived, and will depend on the choice of 
the parameter for which the median is estimated. 

A detailed study to tease apart the complex inter- 
play of AT*, Z, SFR, and SSFR that exploits this use of 
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different binning order for all the relationships will be 
presented in Lara-Lopez et al. (in preparation). 

6. Discussion and conclusions 

We have discussed the metallicity estimator of N06 
showing that it is not accurate at high metallicities due 
to the saturation of the N2 method in the high metallic- 
ity regime. The use of a reliable metallicity estimator 
is crucial when analyzing t he SFR dependenc e of the 
M*-Z relation as shown in I Yates et alj (12012I) . 

The emission-line galaxy spectra from SDSS are 
high quality, and measurements for many emission 
lines are available, making it possible to determine 
the gas-phase metallicity more robustly by applying 
techniques which consider the ioniz ation degree of th e 
gas. Examples of t hese methods are McGaughl(ll99ll): 
iKewlev & Dopitald2002l);lKobulnickv & Kewlevl(l2004t 
and iTremonti et alJ (120041) ("which are ba se d on pho- 



marily aimed at identifying the most concise represen- 
tation of M+, Z and SFR in order to facilitate more de- 
tailed exploration of the interplay between these prop- 
erties of galaxies. Nevertheless, in cases when more 
robust techniques to estimate the stellar mass (e.g., 



Taylor et al. 201 1) are not available, the use of the FP 



toioniz ation models) and Pi 
d2005l) and IPilvugin et al. 



vugiiJJ2001allbT) : IPilyugin & 
d2010l) (which rely on 



Thuanl 



datasets for which Z is known using the T e method). 

We analyzed the 3D distribution of M*, Z, and 
SFR using three different approaches: (z) fitting a 
plane using PCA, (if) fitti ng a plane through regression 
( Lara-Lopez et al. 2010al) . and (Hi) binning in SFR 
and M+ to obtain the median metallicity of each bin 
(Man nucci et al ' 7 or the five methods used, we 

estimated the x 2 red as a measure of goodness of fit (Ta- 
ble [TJ. We find that the best representation of the data 
is the plane defined by regression on . 

We compare the Mannucci et al. (2010) surface 
(the FMR) and the lLara-Lopez et al] d2010ah Fun- 
damental Plan e (FP), and dem onstrate that the FMR 



could be used to estimate the stellar mass, being care- 
ful to take into account the metallicity and SFR uncer- 
tainties. 

Our analysis of the Z-SFR relation with the two 
approaches toward binning the data highlights a cru- 
cial need for caution in interpretation when exploring 
distributions represented as median values. The inap- 
propriate interpretation of such results will lead to ap- 
parently contradictory conclusions, depending on the 
binning order used. Furthermore, presenting medians 
in bins as a three-dimensional distribution will lead to 
differing representations of the data, depending on the 
binning order chosen. 

The SFR of a galaxy relates to the amount of gas 
currently being converted into stars, and correlates 
with the current mass in stars, while metallicity is a 
measure of the number of times that the gas has been 
reprocessed by stars, and also correlates with the cur- 
rent mass in stars in a galaxy. The fact that we can rep- 
resent M+ as a linear combination of SFR and metal- 
licity suggests that the stellar mass of a galaxy can be 
thought as the rate at which a galaxy is currently form- 
ing stars (SFR), plus a measure of the star formation 
history, which is here represented by the metallicity 
(Z), corresponding to the amount of reprocessing of 
the gas by past stellar generations. The SF history and 
current SFR of a galaxy are closely linked to M*. 



proposed by Mannucci et al. ( 2010h does not pro 



vide the best representation of the data. While PCA 
also does not provide the best representation, it does 
demonstrate that the 3D distribution can be adequately 
represented in 2 dimensions (a plane). The M*-Z, 
M*-SFR, and Z-SFR relationships are then projec- 
tions of this plane. W e highlight here that using the 
iMannucci et al. ( 2010l) FMR to estimate metallicities 
from M+ and SFR can result in large errors, up to 
~ 0.2 dex in oxygen abundance (Fig. |7J, being larger 
at high metallicities as a c onsequence of the inappro- 
priate use of the Nj index ([Lopez-Sanchez & Estebanl 



To f urther support this result, lLara-Lopez et al 



2010; iLopez-Sanchez et aljhoilh 



We also highlight that the plane found by the regres- 
sion fit on M+ is not developed as a new technique for 
stellar mass estimation. Rather, this approach is pri- 



(l2010al) demonstrated that high redshift data agree 
with this FP, with a lack of evolution in the FP up 
to z ~ 3.5. This is explained by SFR and metallicity 
evolving in opposite directions: while high redshift 
galaxies show higher SFR than local galaxies, they 
also present lower metallicity abundances than local 
systems. These differences make it possible to re- 
cover, independently of redshift, the M+ of a galaxy as 
a combination of SFR and Z. 

We warmly thank the referee for extensive com- 
ments that have led to significant improvement in this 
paper. We thank Mercedes Molla, Cesar Esteban, 
Alessandro Ederoclite, and Damaso E. Chicharro- 
Martmez for useful comments. M. A. Lara-Lopez 
thanks the "Summer School in Statistics for Astronomers' 



10 




Fig. 8. — (a) Filled coloured circles correspond to the Z-SFR relation binned following IMannucci et al. (2010), who 
took bins in log(SFR) and estimated the median 12+log(0/H) per bin for different ranges of log(M*/M ), as shown in 
the color bar. (b) The same data, but now taking bins of 12+log(0/H) and estimating the median log(SFR) using the 
same mass ranges as in (a). Black dots and white contours correspond to the SDSS sample. Yellow triangles show (a) 
the median metallicity in SFR bins, or (b) the median SFR in metallicity bins, considering the whole sample. 



Center for Astrostatistics, PennState, especially to 
Arnab Chakraborty for invaluable tutorials on "R" and 
PCA. 
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